Online Decision Making under Stochastic Constraints
نویسندگان
چکیده
This paper proposes a novel algorithm for solving discrete online learning problems under stochastic constraints, where the leaner aims to maximize the cumulative reward given that some additional constraints on the sequence of decisions need to be satisfied on average. We propose Lagrangian exponentially weighted average (LEWA) algorithm, which is a primal-dual variant of the well known exponentially weighted average algorithm, and inspired by the theory of Lagrangian method in constrained optimization. We establish expected and high probability bounds on the regret and the violation of the constraint in full information and bandit feedback models for LEWA algorithm.
منابع مشابه
Designing a new multi-objective fuzzy stochastic DEA model in a dynamic environment to estimate efficiency of decision making units (Case Study: An Iranian Petroleum Company)
This paper presents a new multi-objective fuzzy stochastic data envelopment analysis model (MOFS-DEA) under mean chance constraints and common weights to estimate the efficiency of decision making units for future financial periods of them. In the initial MOFS-DEA model, the outputs and inputs are characterized by random triangular fuzzy variables with normal distribution, in which ...
متن کاملA New Dynamic Random Fuzzy DEA Model to Predict Performance of Decision Making Units
Data envelopment analysis (DEA) is a methodology for measuring the relative efficiency of decision making units (DMUs) which ‎consume the same types of inputs and producing the same types of outputs. Believing that future planning and predicting the ‎efficiency are very important for DMUs, this paper first presents a new dynamic random fuzzy DEA model (DRF-DEA) with ‎common weights (using...
متن کاملOptimizing a bi-objective vendor-managed inventory of multi-product EPQ model for a green supply chain with stochastic constraints
In this paper, a bi-objective multi-product single-vendor single-buyer supply chain problem is studied under green vendor-managed inventory (VMI) policy based on the economic production quantity (EPQ) model. To bring the model closer to real-world supply chain, four constraints of model including backordering cost, number of orders, production budget and warehouse space are considered stochasti...
متن کاملElectricity Procurement for Large Consumers with Second Order Stochastic Dominance Constraints
This paper presents a decision making approach for mid-term scheduling of large industrial consumers based on the recently introduced class of Stochastic Dominance (SD)- constrained stochastic programming. In this study, the electricity price in the pool as well as the rate of availability (unavailability) of the generating unit (forced outage rate) is considered as uncertain parameters. Th...
متن کاملUsing Genetic Algorithm in Solving Stochastic Programming for Multi-Objective Portfolio Selection in Tehran Stock Exchange
Investor decision making has always been affected by two factors: risk and returns. Considering risk, the investor expects an acceptable return on the investment decision horizon. Accordingly, defining goals and constraints for each investor can have unique prioritization. This paper develops several approaches to multi criteria portfolio optimization. The maximization of stock returns, the pow...
متن کامل